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The field of the invention relates to color image processing and in particular to a system and method for 
finding ares of similar color in electronically captured spot color images and replacing the similar colors with 
a single dominant color. 

Several current computer desktop publishing and image processing systems permit users to create 

5 and/or modify synthetic graphical images and text. In the latter case, the image is retrieved directly form 
memory to a display device for the user to view the image and perform image processing routines that are 
supported by the system. Prior to being able to retrieve the image from memory, however, the image must 
be scanned using an image scanning device (scanner). Image scanners convert an original image, from a 
document for instance, into raster format of a predetermined resolution in dots per inch (dpi). 

io Any scanner, or almost any electronic device that captures image data will take in an image as either a 
greyscale value from 0 to 255, or a color value typically split into red, green and blue (RGB), with the 
intensity of each color being represented by some numeric range from 0 to 255, for example. Images 
represented in this way are called usually continuous tone or "contone" images. 

Contone therefore refers any process that assigns a range of values to each pixel. A greyscale image is 

75 a contone image, because it contains a range of values from 0 to 255, in the most common case. An RGB 
image is typically contone because each of the colors is represented by a range from 0 to 255. 

Black and white contone is the simplest case; for greyscale, 0 is full black, and 255 is full white. Color 
contone is more complicated, but the same principal holds: each color is assigned 3 bytes, each byte 
represents R, G, or B and the magnitude of corresponding byte represents the intensity of that color. 

20 A contone image can be thresholded using a simple threshold method. A simple threshold method will 
translate a greyscale value above 128 to white, and a value below 127 to black. Thresholding for color 
images to produce a color image is somewhat different. 

Color thresholding is more commonly called "posterization". Posterization is a technique for extracting 
a pallet image from a true color image. A true color image, is an image that contains components of red, 

25 green and blue. Typically, there are also other color spaces, but red, green and blue is the most common. 
As discussed above, each RGB color has a value between 0 and 255 associated with it. 

In this case, the image would actually have three bytes of information, (e.g., three bit components). One 
being red, one being green, and one being blue. Each pixel would have those three bytes associated with it 
to describe the color represented by that pixel. Alternatively, a pallet image would have one pallet index per 

30 pixel. The pallet index is an index into a pallet of values for red, blue and green. 

The process of posterization is basically one of taking that true color image and converting it to the 
closest match in a pallet by substituting the RGB pixel with the pixel in the pallet, or in the case of 
converting it to a pallet image, to replace it in the index. The true color image is then converted to a color 
image that is made up of indexes into a pallet (a pallet image). Posterization is therefore the process of just 

35 mapping RGB colors into that pallet, and the resulting image will then have only those colors in it. 

A "spot color" image, in its purest form, is an image that has been printed by an offset printing press 
where each color in that image is reproduced by a separate print roll and ink. Each color is unique and is 
solid. The final image is comprised solely of spots of color. 

Consider a spot color image including six colors, orange, yellow, green, red, blue and black; each being 

40 solid and printed on a card stock. In order to produce this original image, it would have been run through 
six separate presses, each one laying down one or more solid areas of a specific color. 

This is different from a conventional "half-toning" process that is used to produce contone images, or 
photo-like images for magazines and books. Half-tone images are produced using four presses; one each 
running cyan, yellow, magenta and black, respectively. Such a system permits creating images with, color 

45 combinations by doing half-toning with those various colors. But the colors are not solid, because they 
contain patterns in them. Thus, the half-toning process is not a spot color process. 

Consider a photographic image reproduced using a half-toning process. Looking closely at the half- 
toned image one will see that it is actually made up of a series of pattern of dots. Basically, the dots will 
vary in size in order to try and represent a photographic type image. It is actually at a much coarser level 

so then the original photograph. Black and white half-toned images do the same thing by varying the number 
of molecules that are converted to a certain color. Black and white half-toned images look very smooth, so 
do color half-toned images. It is because they have very fine granularity. On an offset press, the grain is 
larger. Looking at a newspaper, the grain is poor and one can actually see the patterning very clearly. The 
patterning being a regular grid of dots, and the dots varying in size to represent different intensities, which 

55 gives the photographic term, half-tone. 

When a spot color image is scanned and input to a computer, the resulting colors actually comprise a 
variety of colors. This loss of true color is the result of noise in the system: from the inconsistency of the 
printing process and the scanner itself. There may be a great deal of variance in each of the color 
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components. For instance, in an RGB scanner, the change over a range of from 0 to 255 maybe * 10 At 

7^llTJl e T T^ 6 *! 0 20 VariantS> 20 X 20 X 20 ' 0r 8 - 000 different di 9»*' color shades may 
look like one single color to the naked eye. y 

Users commonly load a scanned spot image into a paint package or presentation package to edit it or 
5 change colors. The scanned image as it appears on the CRT may still look like a spot color However the 
scanned spot ,mage w.ll inevitably include many undetectable shades of color Several commercial 
available packages include functions to change colors that appear on the screen commercially 

tn, t F °;,f Xamp ' e ' * e paint bucket tools wnich are common in Macintosh® parint packages. The user may 
try to fill a particular spot color object with a new color. Because there may actually be 8000 Zvent 
10 colors, using the fill function may yie.d no noticeable result. Thus, short o editing ever single™ 
Deski " JeC ! C3 T b8 8dited - PC Paintbrush@ is a P^uct available for IBM^ased systems a d 
2£?!L P T PfE? aVaHab,e fr ° m Z9dC0r ,nC " Which operates on *• Maci "^h, both packages 
fi! a certain Zt T, * ^i*™- DeSkpaM * m ° re so P hi ^ed in that it allows the use? to 

In! h 9 !- ♦ 0WeVer ■ rt ' S n0t 6ffeCtive because the ra "9 e is ver V dif ficult to control and the 

is image becomes distorted due to problems in the transitions between colors 

C0 | O r rh On e thl e ^ a9eS H tha i are halW0n8 3nd imag6S * at are Ph^a^Phs that users perceive as being spot 
™ *,t?l h th9re ^ C0 '° r drawin9S Which wou,d be the eQ - uivalent ° f spot color, images that 

folf m P k d °" S8Parate rUnS ^ dothers that « P^uced with half-tones of colors. Cofor ha.f- 
tones would be createo witn a hart-tone process on the press, usually intended for contone, but color 
20 drawings are often rendered with color half-tones. Finally, there is the color photograph 

oeneri! 1 th?!r r ha " d ' f ° nS wWch haS bee " Created ^P 1 ' 03 '^ bv a computer. There are 

generally three ways of creating an image. One would be nature, like taking a picture of a person or a 

EETla2 H " ■ ™™l im396 - * * USUa " y 3 C ° nt ° ne im3ge - The next source ' ould * a hand-c eld 
image, hand drawings and paintings, for example. Typically, some of these can be synthetic images or 

25 rs.;;:r third c,ass wou,d be a « zi s 

inn,?I n a Umber 1 ° f differen * co,ors that are in an image goes down considerably from real life, which 

numht n 3 T " ° f rea ' CO, ° r Vanati0nS ' t0 3 ^"'helically generated image in which the 

number of colors ,s significantly reduced. The number of real colors will rarely be above ten in a synthetic 
30 image. Usually a synthetic image would include four or five colors syntnetic 

nnti^Th?'; I" & ° f 3 3 "° house V*™*** ** * computer. One immediately 

T£?£Zf£r* f r' id Wa " S ^ ' itt,e teXtUring - A,S °- there is no noise in »• w "ich is one 
Inl T ^ fe a0d thS fee,ing ° f r63lity to natural ima 9 es - 0b i ects in the real world are not truly 

smooth. Objects that are synthetically generated are typically very flat and smooth 

35 rniJl 6 TT and meth ° d ° f * he PreS6nt invention (the present invention) are for finding ares of similar 
color ,n electronically captured spot color images and replacing the simi.ar colors with a single domlnan! 

«. Jronv jl,nCti0n T ? th PreS8nt inVenti0 " 3n 0rigin31 ima " is first scanned * low resolution to generate an 

. T"'- I" St6P 18 Ca,,ed th8 Pr ° SCan Step " Using information ,rom * h e prescan step the system 

to classifies colors in the generated image using a three dimensional (3-D) histogram. The dimensions oHhe 
histogram are the red. blue and green (RGB) colors of the scanner. However, the dimensions of Te 
Snn thTh- ? mprise | 0the / known ima 9° coordin ate systems. A pallet of dominant colors is identified 
fLh T m aSS ' ,Cat, ° n - ThS C0 ' 0r P3,,et iS USed to identif y *• dominant colors in the image. 

« ^MhSn^i „ I tn l or, 9'na' -mage is scanned at a second resolution, the second resolution being 
s substantially higher than the first resolution. The scanned lines are then processed to determine where 

mTmo^ • " e ST C0 ' 0rS eXiSt " Th9 '° Cati0nS ° f the dif,erent co, °' s ar « noted ^ storeS n 
memory), using a break out box comprising a set of rules to evaluate each pixel 

if thJn^H" 18 ,° f ea ° h - ,ine ar ? correlated witn each otner and with segments of adjacent lines to determine 
o r r S ' m,,ar '° Cati0nS t0 th8reby identify various spatial areas of color representing Z 

SP ^ ^ f,na " y C ° mpared With the pa,let of colors a "d the average noted color oi the 
spatial areas are replaced with the closest corresponding pallet color 

FlS^hol^ hl^ b f H r UnderS * 00d if reference is made to the accompanying drawings in which: 
FIG. 1 shows a high level diagram of a hardware system to implement the present invention- 

p t 3 k ° h ' eVel ,,0W Ch3rt ° f Sp0t CO, ° r extracto n in conjunction with the present invention; 

s - 3 A '° show 3 spot color image and three color profiles of the image- 
FIG. 4 shows a 3-D RGB histogram in conjunction with the present invention- 

inventio 5 n A ^ B ^ d8PiCtin9 3 " ° Perati ° n f ° fi " d COl0rs in injunction with the present 
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FIG's. 6 A and B show a color cloud in conjunction with the present invention; 

FIG. 7 shows a flow chart depicting an operation to find color areas in conjunction with the present 
invention; H 11 

FIG's 8 A and B show features of the break out box in conjunction with the present invention- 
s FIG. 9 shows an area matching FIFO in conjunction with the present invention; 

FIG. 10 shows a flow chart depicting an operation for correlating color areas in conjunction with the 
present invention; 

FIG. 11 shows the creation of color paths in conjunction with the present invention; 
FIG. 12 shows a flow chart depicting an operation for updating data structures in conjunction with the 
10 present invention; 

FIG. 13 shows vertical jaggy removal in conjunction with the present invention; 

FIG's 14 A and B show horizontal skew correction in conjunction with the present invention; 

FIG's. 15 A and B show vertical skew correction in conjunction with the present invention; ' 

FIG. 16 shows noise removal in conjunction with the present invention; and 

75 FIG. 17 shows a flow chart depicting an image output operation in conjunction with the present invention 

In conjunction with the present invention images are processed on a line-by-line basis. Each line has 

some number of pixels horizontally which are referred to as the width. The number of lines are referred to 

_ 5 *?J^J*J^^.^ the hei 9 ht in P«els (or the number of pixels horizontally and the number of 
lines) aescnbe the dimensions of the image in pixels. .„ ^ ^, . ^. , ,. , 

20 In the horizontal direction, the width in bytes is also keep track of, the idea being that the image can 
fully be described given the number of pixels horizontally, the number of lines, the width and bytes of the 
image* 

There are several different types of images that are used by the present invention. Black and white 
images can be processed and output as one bit per pixel, based on greyscale images input as 8-bit words 
25 i.e., one pixel per byte. 

In addition the system processes 2 RGB data formats. One is 24-bit where there is a red, a green and 
a blue byte. The other one is 32-bit where there is an alpha byte, a red, a green, and a blue. The alpha'byte 
is actually unused. The 32-bit scheme is convention on the Macintosh®, and permits loading and storage of 
RGB pixels as one 32-bit integer to speed-up processing. These type of data are processes and output as 
pallet images. A pallet image comprises an 8-bit per pixel image, thus, each byte represents one pixel As 
discussed above, each 8-bit word is an index into a pallet of colors. 

All example of hardware system for implementing the present spot color extraction invention is shown in 
Fig. 1. The computer platform 102 includes hardware units 110, including a central processing unit (CPU) 
114, a random access memory (RAM) 112, and an input/output interface 116. The RAM 112 is also called a 
35 main memory; 

The present invention is depicted as a process 130 for spot color extraction of images scanned by a 
scanner 134 and stored on a data storage device 124. The data storage device 124 is also called a 
secondary storage and may include hard disks and tape drives. The data storage device 124 represents 
non-volatile storage. oom=> 

<° The present spot color extraction invention may be implemented in software or directly in hardware 
The present invention could also be implemented in a combination of software and hardware, by optimizino 
specific functions in hardware to increase efficiency of the overall system 

The computer includes as a display 120, a keyboard 121, a mouse 122 and a printing device 128 
Users 132 interact with the computer and the spot color extraction process 130 via the display 120 the 
45 keyboard 121, and the mouse 122. 

Examples of systems that support the present invention include the Macintosh® Model II SE and Plus 
personal computers families, as well as most IBM PC compatible 286, 386 and 486 models and their 
equivalents. 

A high level overview of the present invention will now be discussed with reference to FIG 2 A preview 
scan 202 is performed to obtain a low-resolution image of the original to be spot color extracted. This image 
is 8-bit greyscale or 24-bit true color. 

Using the information from the preview scan 202 the system and method of the present invention find a 
pallet of colors using a 3-D histogram (see step 204). Finding the dominant colors is accomplished by 
creating clusters" of colors that are related. This is done by taking the maximum occurrence and finding 
the colors around that maximum occurrence that are related and including the related colors in the color 
cluster. 

one N c 8xt a ? 9h resoluti <> n scan of each line is obtained in 8-bit greyscale or 24-bit color, as shown at step 
206. From this point the lines are processed individually so that the following step, step 208, can find areas 
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on each current line that contain the same or similar colors, peak colors and color transitions. 

The areas of the line identified in step 208 are correlated with each other and areas of adjacent lines, as 
shown in step 210. 

A data structure is created and updated which will represent outlines of spot areas determined in the 

5 previous steps. (See step 212.) The system then determines whether additional lines must be processed, as 
shown in the conditional statement at step 214. At this point, if no more lines are to be processed, the 
system then prepares to output the spot color extracted image. If further lines are to be evaluated, the 
system jumps back to begin processing the additional line(s), as shown at step 206. 

Three additional process steps are optionally carried out by the system, as shown at steps 216-220. 

10 Line "jaggies" are removed from the image described in the data structure, as shown at step 216. At step 
218 skew anomalies are removed, and at step 220 noise is removed. The filtered or non-filtered spot color 
extracted image is then output, as shown ar step 222. 

The above steps of the spot color extraction system and method of the present invention will now be 
described separately in further detail. 

75 The preview scan is a relatively straight forward and conventional step which permits the user to 
preview the entire image or portion of the image. The preview scan is performed at low resolution in 24-bit 
color or 8-bit greyscale at about 37 dpi. The preview scan generates an image the user can view on the 
screen. Using well known techniques, the user is permitted to select an area which he may want to perform 

" " the spot color extraction on. Alternatively, the entire prescanned spot image may be selected. 

20 The dpi varies based on the size of the preview area and whether the user chooses a complete or 
partial high resolution scan of the previewed area. The low resolution scan may vary anywhere from 37 dpi, 
which is a good initial preview scan, up to the scanners maximum scan rate. As will be discussed below, 
the 3-D histogram will throw out a percentage of data to determine the pallet of colors; hence, a large dpi 
for the low resolution scan is unnecessary. 

25 The software or hardware for implementing the preview scan is standard, and is typically provided to 
users by the scanner manufacturer, or second source vendors. Products such as Scanning Gallery 2.0, 
Scanning Gallery 5.0 for IBM-based machines and Deskscan 1.0 for the Macintosh®. (These three 
packages are products of Hewlett-Packard Company, 700 71 st Avenue, Greeley, CO 80634.) These and 
similar packages will permit the user to select image areas using a mouse and the cursor to create a 

30 selection box around the area to be scanned at high resolution. 

Prior to discussing the 3-D color histogram, a 2-D gray-scale histogram will briefly be explained. A 2-D 
histogram includes one graph along the X-axis, for example, with values ranging from 0 to 255. These are 
the intensity of the greyscale values from black to white, respectively, along the X-axis. Along the Y-axis, 
would be the number of occurrences of each of those intensity values. For instance the plot of a greyscale 

35 image of a white page filled with black text would yield a 2-D histogram with a large clump near 0, 
representing the black text, and large clump near 255 representing the white background. 

As another example, a 2-D histogram of a greyscale image photograph of some real life object would 
have a range of intensities spread over entire X-axis based on the content of that image. If it was a dark 
image, there would be more towards the black, and a lighter image would contain more occurrences be 

40 more towards the white end of the scale. The 2-D histogram of a complex image may look like a mountain 
range due to the many different shades of grey. Thus, the histogram charts the number of occurrences of 
each intensity level. 

A 3-D color (RGB) histogram has three axes, red, green, and blue, respectively, each ranging in 
intensity for 0 to 255. Thus, at the origin (0,0,0), each of the three intensities are 0 and the corresponding 
45 color is black. At the far corner of the 3-D RGB histogram is the point (255,255,255), corresponding to the 
color white. 

A black and white representation of an original spot color image is shown in FIG. 3A. The outside 
border 302 is red, the background 304 of the words "COLOR SCANNING" is yellow, the lettering 306 is 
blue and the letter shadowing 308 is black. The lines 310 indicate a slice taken through the image to show 

50 the color profiles of FIG's. 3B-3D. FIG's. 3B-D represent color profiles of the slice corresponding to the 
colors red, green and blue, respectively. 

The color intensity of a border section 312 is also shown in FIG's. 3B-D at sections 314. Because the 
border section 312 is primarily red, the intensity of the red profile is high, the intensity of the green profile is 
very low, and the intensity of the blue profile is substantially zero. The color intensity of a background 

55 section 314 is also shown in FIG's. 3B-D at sections 314. Because the background section 314 is primarily 
yellow, the intensity of the red profile is high, the intensity of the green profile is high (i.e., red and green 
make yellow) and the intensity of the blue profile is substantially zero. 

A section 316 of the letter "C" is shown in FIG's. 3B-D to have very little red, some green and a high level 
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of blue. 

inn- Fi t na I h y " a K, b ' a ^ b ! rder SeCti0 " 318 iS 3,30 depicted in the three P rofi,es " Each of *e three color profiles 
indicate the black border section 31 8 has very little color intensity. 

The remaining section of FIG. 3A is also shown in HQ's. 3B-D. Note that the color plateaus in these 
figures are jagged. This is due to the scanning process and the introduction of noise. Moreover, the oriqinal 
spot color image may have had various color variations. 

A 3 " D RGB Histogram of the original image in FIG. 3A is shown in FIG. 4 (also see step 502 of FIG 5) 
I u! < L COl J 0r 3X93 ar9 lab9 ' accordin 9"y and are broken down into 32 increments. The origin (0,0 0) is on 
the right hand side of the figure. The "clouds" in FIG. 4 represent the number of occurrences of the four 

S2 , S h Jno ' " ' 3nd yellOW ■ ° f th8 CrOSS section 310 of the ori 9 inal >™9* in FIG. 3A. The cloud 
labeled 402 represents the occurrences and intensity of the black pixels from the original. The black cloud 
» portioned near the origin. The cloud labeled 404 represents the occurrences and intensity of the red 
pixels in the original. The red cloud 404 is positioned adjacent the red axis. The cloud labeled 406 
represents the occurrences and intensity of the blue pixels in the original, and is positioned adjacent the 
blue axw The cloud labeled 408 represents the occupancies and intensity of the yellow pixels in the 

2" y r C ' 0ud 408 is Portioned in the foreground of the 4-D cube and adjacent the plane 
formed between the red and green axes. 

The black , red and blue clouds are difficult to see in the 2-D representation of a 3-D figure but thev do 

T^TTt^T ** ye "° W ° ,OUd - 11,6 COi °' S e^h clou'a are actuaiiy concentratea in the middle 
of the cloud. The outer sections of the clouds are lighted in color, which indicates that the number of 
occurrences of these outer shades are lower. The center of each cloud has a section of maximum 
occurrences. Note that all spatial information is lost by the histogram, because it only represents™© 
occurrence of the pixels having a specific color. 

The next task is to find the center of each cloud. This will represent the actual color that was to be 
represent by the corresponding spot color. 

■♦uloM h I ee l9aSt si9nificant bits are 'Snored and the histogram becomes 5-bid color. 0-31 on each side 

, , 2?J° tal entrieS ' EaCh °" e 0f entries is referred to as a " ce ""- Each cell represents 64 values in the 
original RGB space. In the search to identify the shades relating to a particular maximum occurrence if the 
number of occurrences goes down, then the cells is included until there are less than 10 occurrences in a 

n „ m J°7 tll 1 ally ' if A the u histogram has a peak, then a valley and another peak, the peak has a maximum 
th« £l ^ top °9 raph y is traversed, the cells will have lesser and lesser values in them until 

occurlr-1 I" y £ ^ th8 ^ StartS t0 9 ° UP the next sl °P e - and *• number of 

occurrences will go up. The search in that direction then stops, and all cells down until the valley are 
included. Going up again indicates that the search reaches another color cloud 

The system first looks for the cell with the maximum number of occurrences, see step 504 in FIG 5 A 
ce I 602 with a maximum number of occurrences is shown in FIG's. 6A and B. All adjacent cells with similar 
colors must now be determined. 

rtaf^S? C6 1I T immediate nei '9hbors (six sides per cell). Looking in each direction the system must 
determine whether or not to include cells with similar color and decreasing occurrences. FIG's 6A and B 
7ZT nt ^ Se f rCh ° nS ° f the SiX directions - Tne « r st Plane of nine cells is automatically included in the 

shown ^O^nRr r 3 ! £ 9 h 3 Sim : ,ar | CO,or Tne fi 9 ures show a sea «* 'n the positive red direction. As 
shown at 602 ,n FIG. 6A, the adjacent plane is included. Therefore, in the six directions a total of 26 cells 
would be automatically included. See step 506 in FIG 5. The asterisks in the cells indicate those cells that 
have a decreasing number of occurrences and have therefore been tagged. 

An offset is used for moving from plane to plane. All offset of one is a plane 602 adjacent to the 
maximum cell 601. An offset of two would be the next plane 604 in that direction, and so forth Thus the 
offset represents how far from the maximum cell the slices are searched through the RGB histogram ' 

An offset cut-off is also included. The offset cutoff in any one direction is six planes away from the 
maximum cell. The maximum number of cells in a cluster would therefore be 12x12x12. The cutoff mav be 
user defined. Typically, in an RGB histogram the number of occurrences in the search decreases very 
rapidly and a cut-off is not necessary, or rarely reached. 

An offset is incremented, as shown at 508, and the next plane in the positive red direction is evaluated 
as shown at 510. Then the red plane in the opposite direction is evaluated, as shown at step 512 This 

£St^^*T^«!T T b ' Ue P ' aneS (i e " b0th P ° Sitive and ne9ative directions a re searched). 
See steps 514-520. The offset .s then incremented (see 522) for evaluating the next set of planes. At step 

524 the system stops searching if no further matches occur (e.g.. the number of occurrences is less than 10 
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Once the cells in the cloud are resolved, the average color for that cloud is determined usinq color 
averaging techniques well known to those of ordinary skill in the art. The result is then recorded in a palle 
of colors representing the colors most likely to have been the original spot color of the image. See step 526 

After the first cloud's color is recorded in the pallet, all the cells corresponding to the cloud are zeroed 
as shown in step 528. The next maximum cell is found and the cells adjacent to that cell are evaluated 

ext'Sed 9 ^ *" ^ The PreC6SS COntinues until a " S P°' ™>™ ™ 

nm™* reS ° lution t scan is Permed "ext. The high resolution scan is similar to the preview scan except 
only one scan line of information is processed at a time to conserve memory space. Again, 24-bit true color 
or 8-b.t greyscale data is obtained at between 1 00-400 dpi, for example 

Each line is optionally color filtered (i.e., averaged) to smooth out the data. If the image is below 75 dpi 
there is no need for filtering. If the lines are scanned at between 75-150 dpi, every 3 pixels are averaoed 

t^etsTth 1 r 5 ouo??h d d i, "7 6 ^ ™ ** 100 ^nTe^M. 

the first th ough third pixels are averaged to get an average value of the second pixel. The second throuqh 

i°riT are h aVera9ed I 0 yie ' d a " aVera " Va,U8 f ° r the third pixel - The ori 9 inal data set are used for the 
averaging and the new va ues are stored in a new data set. This averaging may be done on a per color 
basis by separating each of the R, G and B elements. This process is shown in steps 702-704 of FIG 7 

,„ _ Tne t color f reas ,n eacn l,ne are located using what Is called a =break out box", see step 606 of FIG 7 

20 The function of the break out box is described with reference to FIG's. 8A and B 

The breakout box can identify seven pixel states, as shown in FIG. 8A. A plateau 802 is detected if 

Id J2 ?h° X K a T eS T PiXe ' *° the neXt th6 Pixel 6xitS the box throu 9 h a side - ,f *e Pixel enters 
and exits the box through the bottom, a peak intensity 804 is detected. A valley 806 is detected if the pixel 

25 STcSSS?« £ EfiS^ The be9innin9 and end of a s,ope (808 ' 810) - as we " as c " 

The height of a break out box 816 represents an intensity tolerance T, and D represents the distance 
oZe tZTX th6 T baCk T fr ° nt ° f ^ b ° X m9aSUred in piXe,S " ThUS ' D " d T r °>™°« the Paramos 
?51 1600 dpi ^ mtenS,ty ° f 16 ' 9XamPle ' and ° ^ be S8t 10 3 f ° r °' 15 ° dpi ° r 4 for 

wwTSif i $h0WS ° ne . C0l0r u 8lement of one ima 9 Q line - c o'or intensity is shown on the Y-axis and the 
th f mp ' Xe,S : S Shown on the *-»«*■ The break out box is a fast and efficient method used to 

71™ h f S,ty T P ' Xel and h ° W th9 int8nsity chan9es alon 9 «» ,ine - ™« information is used to 
determine and store color areas of the line. 

m ^fT 818 iS d8t ! Cted 38 Sh ° Wn 31 b ° X 820 The intensity of the P |ateau is determined to be about 
hil£L Ik be f nn,ng a " d end P ixels are shown a, ong the X-axis. A box 822 represents the change of slope 
between the plateau 819 and a peak 824. An increasing slope is then detected, as shown at a box 826 The 

£T TTJ T P"i 824 iS d6teCted by 3 bOX 82a Tne intensit V value 22 ° ^ored for the peak 
? L second plateau is shown at 830. At a box 832 a valley 834 is recorded. The valley 834 is bounded 
by two plateaus 836 and 838. The line terminates at the end of a plateau 840 oounaea 

n.JT< h T eaU ' P6ak T d ""I" 8y> iS d8SCribed by 3 5,311 and an end P° int - and a «**• The color for a 

s^;:*: & — va,ue: for a va,iey ■ is th ° min ^ *» *• ■ * - «*- ^ * 

Correlating the areas of color is then performed by a method shown and described in FIG's 9 and 10 

r©sp©ctiv©ly. * 

FIG. 9 shows first-in-first-out (FIFO) buffer for matching color areas. The FIFO contains 7 lines, but not 

o 1T„L VT? i I r 6 iS °" e 6ntry int ° the F,F ° f ° r e3Ch line - The FIF0 is implemented as an array 
2£t h 3 ?T StrUCtUreS - Th8 8truc * ure8 d8fine * e ,ine ^ areas " Each ^"oture comprises a 
^tl hT n "l. C ° ° r (b6C3USe * he " ext C0l0r st3rt defines the end of the P^ious one), a flag to 
indicate whether the ares is matched with another area, and there an index into the color pallet The 
maximum number of areas that could be defined by line is set at a limit of 256. for example 

The line that is in the top of the FIFO is referred to as the resolved line. The second line in the FIFO is 
the current line being processed. The next line, and the one after that are used for the matching process 
The system and method are optimized for four lines in the FIFO, but this number may be varied to match 
system speed and efficiency goals. The process begins by adding a new line to the FIFO, see step 1002 

FIG. 9 represents the lines corresponding to a red spot in the middle of a white page. The resolved line 
contains only one area, which is all white. Its starting position is zero. There are no other areas in the 
resolved line, so the end of * will be end of the image. Process continues to the current line. The current 
line has one area that is white and starts at 0. A second area that is red and starts at 100, and third area is 
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white and starts at 200. 

The current line is evaluated from left to right, and each area is compared to an adjacent area in the 
resolved line. The color of the first white area in the current line is vertically compared to the color in the 
resolved line, because the resolved line only has one area. See step 1004. If the color is within the 
5 tolerance T of the pallet color of the resolved line, the first white area in the current line is flagged 
"matched", and process continues to the red area of the current line. 

The system first attempts to vertically match the red area, then the system tries to match horizontally. 
See step 1006. However, because there is no previously matched red area to match it to, this area is 
flagged "unmatched". 

10 The system then attempts to vertically match the next line with the current line. See step 1008. All the 
horizontal matching for the next line is then executed. See step 1010. Next, the system looks for any areas 
that have not been matched. It then checks if the unmatched areas will match up with areas on the next two 
lines. See step 1012. 

In some instances, an area of the current line may not match with an area below it. Such areas are 
is transient or false color. Rather than keeping the area and assigning it an appropriate pallet color, the system 
checks whether a line above it or below it is matched. If the line below it is not matched, the area is 
mapped to the above area. If the line below it is matched, a determination is made as to which area its color 
is most similar. It is then mapped to the area with the closest color. See step 1014. 

The"^system~ then combines aii matched areas. Next, the average coior of the combined areas is 
20 determined, using known methods. The average color is compared to the pallet colors and the closest color 
is set for each area. This process is shown in step 1016. 

The method of the present invention for updating a data structure to keep track of the extracted spot 
colors will now be addressed. 

A somewhat more complex red dot on a white page is shown in FIG. 11. Two paths are defined for the 
25 red dot: a red path 1102 and a white path 1104. 

To update the data structure, the information from the resolved lines must be converted into the path 
1102 that follows the left hand side of the dot and the path 1104 that follows the right hand side. A data 
structure representing the edges has some significant advantages: specifically for doing the jaggy removal, 
as will be discussed below. 

30 One of the main reasons for converting this data structure is to conserve systems memory. The present 
invention converts a spot color into a data structure that is much more compact than the original image; it 
can operate on it in a way that is much faster and then quickly output the image. 

When a line is complete it is outputted from the FIFO, and either a new path is created or the current 
path(s) must be flagged so that they can be updated, as shown at step 1202 of FIG. 12. The first (or next) 

35 area from the FIFO line is obtained, see step 1204, and the system attempts to match that area to an 
existing path (see step 1 206). To do this, the system checks if there is a path that starts and ends with an 
overlap to the new area. 

Step 1208 shows the case in which the system will bypass a path. In such a case, a bypassed path 
indicates that the path has ended because there was no update. The non-updated path is then terminated. 
40 See step 1208. If the path has only one line once it is terminated, it is then eliminated altogether. The 
system removes horizontal jaggies automatically by removing paths that are setermined to be only one 
pixel high. 

Matched paths are then updated. If a path is not matched, or if there was no match on the previous line 
of the current set of paths, then the system will create a new path that represents that new area. See step 
45 1210. 

The data structure continues to be updated while there are areas in the line to assign paths. Thus, if 
areas still remain the system returns to step 1204, as shown in step 1212. 

The five (5) entries that make up each element of the data structure are the flag, pallet color, vertical 
start position, number of lines and the horizontal position for each line, 
so A path can be: empty (e.g., just an unused data structure); possible (e.g., an attempt at a path, or just 
one line); real (e.g., a real path with mode than one line); or terminated (e.g., a completed path). 

Prior to output of the spot color extracted image, additional processing may be done. 

An example of a vertical jaggy is shown at 1302 of FIG. 13. This jaggy is typically an artifact created by 
the scanner. Jaggies on vertical lines are removed by looking for changes in the vertical direction of one 
55 pixel to the right or left and then a one pixel change in the opposite direction. The area between the single 
pixel changes is then set to the value before the change. The following pseudo code is a simple example of 
the vertical jaggy removal technique employed by the present invention: 
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while the next pixel is to the right of the current pixel 
current pixel = next pixel 
5 while next pixel is equal to the current pixel 

current pixel = next pixel 
if the next pixel is one pixel to the left of the current pixel 

70 

top of jaggy = current pixel 
current pixel = next pixel 
^ while next pixel is equal to the current pixel 

if next pixel is one pixel to the right of the current 
pixel 

2 0 set pixels from top of jaggy to current pixel to 

the top of jaggy pixels value. 

All example of a horizontal skew is shown at 1402 of FIG. 14A. Skew in the horizontal direction is 
25 removed by looking for paths with sequences of large changes in the horizontal direction. The values along 
the skew are then set to the values before the skew and after the skew, as shown at 1403 in FIG. 14B. Note 
that the value of 7 was changed to 4 and 1 4 was changed to 20. 

An example of a vertical skew is shown at 1502 of FIG. 15A. Skew in the vertical direction is found by 
looking for long sequences of the same horizontal position with a change of one pixel between each 
30 sequence. Vertical skew is removed by setting all positions to the average value of the start of the skew and 
the bottom of the skew, as shown at 1504 in FIG. 15B. 

An example of noise is shown at 1602 of FIG. 16. Noise is any change of one pixel with a change of 
one pixel in the opposite direction on the next line. Noise is removed by setting the pixel position that 
changed to the value before it, as shown at 1602 in FIG. 16. 
35 Many skew and noise removal techniques are commonly known in the field, one versed in image 
processing will readily be able to adapt standard routines to the spot color extraction system of the present 
invention. 

Finally, an output sequence for the present invention is shown at FIG. 17. The spot color extracted 
image is recreated in the reverse process of the path formation. The output process begins by setting the 
40 current line equal to zero, as shown in step 1702 of FIG. 17. If any path(s) start on this line, they are added 
to the current path(s), see step 1704. The image line is then created based on the current path(s), see step 
1706. As shown at step 1708, if any paths terminated on this line, they are deleted from the current path(s). 
The current line is incremented, and the process loops back to step 1704 until the last line is reached. See 
steps 1710 and 1712, respectively. 

45 

Claims 

1. A method for finding areas of similar colour in electronically captured spot colour images and replacing 
the simitar colours with a single dominant colour, comprising the steps of finding a pallet of dominant 
so colours from an electronic image having individually scanned lines, processing said scanned lines to 
determine spatially adjacent areas of said image with similar colours and noting the colour, and 
comparing said coloured areas with said pallet of colours and replacing said noted colour of said areas 
with the closest corresponding pallet colour. 

55 2. A method according to claim 1 comprising the steps of: 

(1) scanning an original image at a first resolution (202) to generate an electronic image; 

(2) classifying colours in said image using a three dimensional histogram (204); 

(3) identifying a pallet of dominant colours based on said histogram classification (204); 
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(4) scanning a plurality of lines of said original image at a second resolution (206), said second 
resolution being substantially higher than said first resolution; 

(5) processing said scanned lines to determine where within each line different colours exist (208), 
the locations of said different colours being noted; 

(6) correlating segments of a particular line with other segments of adjacent lines to determine if said 
noted colours are in similar locations to identify a spatial area of colour representing said original 
(210); and 

(7) comparing said spatial areas with said pallet of colours and replacing said noted colour of said 
spatial areas with the closest corresponding pallet colour (212). 

3. A method according to claim 2, wherein said correlating step further comprises using a break out box 
for processing each line to determine related areas of the line based on colour threshold levels of 
adjacent pixels. 

75 4. A method according to claim 3, wherein the processing of each line further comprising the steps of: 
identifying and noting occurrences of colour transitions; and 
identifying and noting occurrences of colour plateaux. 

" ■ 6. A method according' to claims 3 or 4, further comprising tne step br'coibur averaging each lihe to" 
20 reduce the variance of colour per line. 

6. A computer system (102) for finding areas of similar colour in electronically captured spot colour 
images and replacing the similar colours with a single dominant colour, the system comprising: 

identifying means for finding a pallet of dominant colours from an electronic image having 
25 individually scanned lines; 

processing means for processing said scanned lines to determine spatially adjacent areas of said 
image with similar colours and noting the colour; and 

comparing means for comparing said coloured areas with said pallet of colours and replacing said 
noted colour of said areas with the closest corresponding pallet colour. 

30 

7. A system according to claim 6, wherein said identifying means further comprises: 

scanning means (134) for scanning said original image at a first resolution to generate said 
electronic image; 

classifying means for cataloguing colours in said image using a three dimensional histogram; and 
35 second scanning means (134) for scanning a plurality of lines of said original image at a second 

resolution, said second resolution being substantially higher than said first resolution. 

8. A System according to claim 7, wherein second scanning means only scans a portion of said original. 

40 9- A system according to claims 5 or 6, wherein said processing means further comprises means for 
correlating segments of a particular line with other segments of adjacent lines to determine if said noted 
colours are in similar locations to identify a spatial area of colour representing said original. 

10. A system according to claim 8, wherein said correlating means further comprises a break out box (816) 
45 for processing each line to determine related areas of the line based on colour threshold levels of 

adjacent pixels. 

11. A system according to any of claims 5 to 10, wherein said processing means further comprises: 

first means for identifying and noting occurrences of colour transitions in each line; and 
50 second means for identifying and noting occurrences of colour plateaux in each line. 
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